Classification Using Jumping Emerging Patterns and Cosine Similarity

نویسندگان

  • Mauri Ferrandin
  • Adão Boava
  • Alex Sandro Roschildt Pinto
چکیده

Classification is a common task in Machine Learning and Data Mining. Jumping Emerging Patterns have been applied for classification in different contexts with good results and the advantage of to be easily understandable for users. In this work we propose the use of cosine similarity measure to select the patterns which will be used to predict the classes in the classification process. Two versions of the algorithm were proposed and tested with four different parameter values in 21 datasets. The results were compared with three frequently used classification algorithms from the literature and proposed algorithms showed a promising results achieving in some the datasets best results than C4.5

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Arabic News Articles Classification Using Vectorized-Cosine Based on Seed Documents

Besides for its own merits, text classification (TC) has become a cornerstone in many applications. Work presented here is part of and a pre-requisite for a project we have overtaken to create a corpus for the Arabic text process. It is an attempt to create modules automatically that would help speed up the process of classification for any text categorization task. It also serves as a tool for...

متن کامل

Detection of Fake Accounts in Social Networks Based on One Class Classification

Detection of fake accounts on social networks is a challenging process. The previous methods in identification of fake accounts have not considered the strength of the users’ communications, hence reducing their efficiency. In this work, we are going to present a detection method based on the users’ similarities considering the network communications of the users. In the first step, similarity ...

متن کامل

SOME SIMILARITY MEASURES FOR PICTURE FUZZY SETS AND THEIR APPLICATIONS

In this work, we shall present some novel process to measure the similarity between picture fuzzy sets. Firstly, we adopt the concept of intuitionistic fuzzy sets, interval-valued intuitionistic fuzzy sets and picture fuzzy sets. Secondly, we develop some similarity measures between picture fuzzy sets, such as, cosine similarity measure, weighted cosine similarity measure, set-theoretic similar...

متن کامل

Improving Semi-Supervised Acquisition Of Relation Extraction Patterns

This paper presents a novel approach to the semi-supervised learning of Information Extraction patterns. The method makes use of more complex patterns than previous approaches and determines their similarity using a measure inspired by recent work using kernel methods (Culotta and Sorensen, 2004). Experiments show that the proposed similarity measure outperforms a previously reported measure ba...

متن کامل

Contrast pattern mining and its applications

The ability to distinguish, differentiate and contrastbetween different data sets is a key objective in datamining. Such ability can assist domain experts tounderstand their data, and can help in buildingclassification models. This presentation will introduce theprincipal techniques for contrasting data sets. It will alsofocus on some important real world application are...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015